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ON SHORT SUMS OF TRACE FUNCTIONS 


ETIENNE FOUVRY, EMMANUEL KOWALSKI, PHILIPPE MICHEL, CHANDRA SEKHAR RAJU, 

JOEL RIVAT, AND KANNAN SOUNDARARAJAN 


Abstract. We consider sums of oscillating functions on intervals in cyclic groups of size 
close to the square root of the size of the group. We first prove non-trivial estimates for 
intervals of length slightly larger than this square root (bridging the “Polya-Vinogradov 
gap” in some sense) for bounded functions with bounded Fourier transforms. We then 
prove that the existence of non-trivial estimates for ranges slightly below the square-root 
bound is stable under the discrete Fourier transform. We then give applications related to 
trace functions over finite fields. 


1. Introduction and statement of results 

Consider a positive integer m ^ 1. Let p : 7i/mfL —» C be a complex-valued function 
defined modulo m, which we also view as a function Z —> C by composing with the 
reduction modulo m. Let / C Z be an interval of cardinality |/| at most equal to m. One 
of the most important general problems in analytic number theory is to estimate the partial 
sum 

n£l 

Of course, one has the obvious upper bound 

|SW)I ^ IMU/|, (i.i) 

where 

IMloo — max \<p(n)\ 

n^Z/mZ 

but the goal is usually to improve significantly this bound when (p is an oscillating function. 
Both the quality of the improvement (i.e., the saving compared with the trivial bound for 
given I) and the range of possibilities for / that give rise to non-trivial bounds are important. 
We will mostly focus on this second aspect in this paper. 
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1.1. The Polya-Vinogradov range. There exists a very general method to estimate in¬ 
complete sums, based on Fourier theory on Z/mZ. This is called the Poly a-Vinogradov or 
completion method. 

For any function p : Z —» C which is m-periodic, we define its normalized Fourier trans¬ 
form (p : Z — y C by 

<P{ h ) = ~7= p ( x ) e (—)> ( 1 - 2 ) 

yjm z J V m / 

tc (mod m) 

for all h E Z, where e(t) := exp(27rit). We shall also find it convenient to use the notation 
e m (t) to denote e(t/m ) = e 2mt ^ m . 

Given an interval / C Z with cardinality at most m, let / be its image in Z/mZ, with 
characteristic function denoted by 1/. The discrete Plancherel formula gives the identity 

S(‘P,I)= E v(h)W): 

h (mod m) 


so that 




oo E 

h (mod m) 


1 f(h) ^ ||^|| 00 m 1/2 log(3m) 


since it is well known that 


E |W) 

h (mod m) 


^ m 1 ^ 2 log(3m) 


(1.3) 


for any m ^ 1 and any interval /. 

Therefore if we assume that ||</?||oo ^ c for some constant c, the bound (1.3) will be 
non-trivial as long as 

|/| ^ cm 1 / 2 log(3m) (1.4) 

which we call the Poly a-Vinogradov range. 

The problem of estimating non-trivially such sums over shorter intervals is crucial for many 
applications, for instance to study averages or subconvexity estimates of L-functions [2, 5] 
(see also [21] for a recent very different situation where this range determines the solution of 
a natural problem). Here we study this problem, starting with a general result which gives 
a modest improvement over the Polya-Vinogradov range (1.4). 


Theorem 1.1. For any interval I in Z with yfm < |/| we have 



n£l 


where c = max(||<d|oo, ||wlloo)■ 


(1.5) 


The estimate (1.5) is non-trivial as soon as / is of length -y/m, and we may view this 
result as “bridging the Polya-Vinogradov gap”. As we will see in Section 2, the proof is very 
simple, but such results do not seem to have been noticed before. While we have given an 
explicit bound in Theorem 1.1, we have not made any attempt to optimize constants, and 
a more careful smoothing argument (for example, using the Beurling-Selberg trigonometric 
polynomials as in [14]) would provide better explicit constants. 
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Before continuing, we note that Theorem 1.1 is essentially best possible, since for p{n) = 
e(n 2 /m) the sum over 1 ^ n ^ m 1 / 2 is m 1 / 2 . Hence any improvement beyond Theorem 1.1 
requires some input on the function p. 


1.2. Beyond the Polya-Vinogradov range. Our next result is concerned with the prob¬ 
lem of going significantly below the range |/| ^ y/m for suitable functions p. There are only 
few results of this type already known, the most famous being the Burgess bound, when 
p{n) = xi n ) is a primitive Dirichlet character. 

We are currently unable to obtain results in great generality, but we will obtain a number 
of new cases by proving a general principle that, roughly speaking, states that if the partial 
sums of a function <p has substantial cancellation near the Polya-Vinogradov range, then so 
does its discrete Fourier transform. We now formulate this principle precisely, giving in the 
next section several applications. 

Suppose throughout that p : Z/mZ —> C is a periodic function with 

c = max(||</?||oo, Halloo). 

For any N ^ 1, we define the sum S(p, N ) by the formula 


S{p,N)= Y fH, 

l^n^N 


and if N ^ — 1, then we put 

S(ip,N) = Y ¥>( n )- 

N^n^—l 

Next define, for any 1 ^ N ^ m/2, 

n) = 7^ + -K, { min (h* Jv) 4)1 + |S(43 ’ - 4) 0 }■ (L6) 

From the definition, it is clear that A (p, N ) is a non-increasing function of N, and also that 
NA(p, N ) is a non-decreasing function of N. Further, the definition immediately gives 

max (jS'Ojf)! + |S , (y3, — ^ cNA(p,N), (1.7) 

and 

max y(|%>,t)| + |%>,-t)|) ^cA(p,N). (1.8) 

m/2^t^N t \ J 

With this notation, our main theorem transfers bounds for A (p,m/N) into bounds for 
A (p,N). 


Theorem 1.2. Let m, p, c and A be as above. For 2 ^ N ^m/2 we have 

|^(^, A)| + |S(£,-fV)| <C cVNrrAA^p^Y, 


and 


In particular 


A(p, N) « 


m 4 


y/N 


A 



1 

2 


A ((p. \fm j -c A (if, , /m ) 2. 


(1.9) 
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If we apply the bound of (1.10) twice, we see that A((p, y/m) -C A(tp, y/rn)^ -C A (Ip, y/m )a, 
so that some loss in precision has occurred. One may wonder if this loss in precision could 
be removed, perhaps by defining some other quantity rather than A. 


1.3. Applications. Our methods apply best to functions modulo m that are pointwise small 
and whose Fourier transform is also small, in a precise quantitative sense. In analytic number 
theory, there is a plentiful supply of such functions which arise naturally in applications: they 
are given by Frobenius trace functions modulo m. 

These functions originate in algebraic geometry, and their analytic properties have been 
investigated systematically in recent years by Fouvry, Kowalski and Michel especially (see 
[8-12] for instance). We will recall briefly the definition in Section 2.2, referring to [13] for 
a longer survey. 

Basic examples of trace functions lead to the following application of Theorem 1.1, where 
we denote as usual by x the inverse of x modulo p for x G F* . 


Corollary 1.3 (Equidistribution over short intervals). Let /3 be any function defined on 
positive integers such that 1 ^ /3{p) —* +oo as p —$■ +oo, and for all p prime, let I p be an 
interval in F p of cardinality \I P \ ^ p l / 2 (3(p). 

(1) Let fi, / 2 G Z[X] be monic polynomials such that f = /i// 2 G Q(X) is not a polyno¬ 
mial of degree 1. Then for p prime, the set of fractional parts 

//Ml 

\ p I’ G/p ’ 


becomes equidistributed in [0,1] with respect to Lebesgue measure as p —$■ +oo, where f(n) = 
fi (n) / 2 (rd) is computed in F p and defined to be 0 if n (modp) is a pole of f. 

(2) For p prime and n G F p (resp. n G F p ), define the Birch (resp. Kloosterman) angles 
6 * 3 iP (n) (resp. 9~i tP (n)) in [ 0 , 7 r] by the relations 


B 3 (n) = — ^2 e p( x3 + nx ) — 2 cos 0 3 iP (n), 

VP 


x£F v 


Vp 


^2 e p (x + nx) = 2 cos 6 i _i iP (t 7 ,). 




Then the angles {0 3 ;P (n), n G I p }, { 6 ) _i )P (n), n G I p — {0}} become equidistributed in [0, n\ 
with respect to the Sato-Tate measure 2n~ l sin 2 9d6. 


Remark 1.4. We use the terminology “Birch angle” as analogous for Kloosterman angles. 
Historically, Birch [1, §3] mentioned the problem of the distribution of these angles as a 
problem similar to the Sato-Tate distribution of the number of points on elliptic curves over 
finite fields. This Sato-Tate equidistribution was subsequently first proved by Livne [22]. 

See Section 2.2 for the proofs, which are direct applications of the Weyl criterion and the 
estimate (1.5). These statements can be generalized considerably to other summands, as 
will be clear from the proof in Section 2.2; there are also variants for geometric progressions 
instead of intervals, which are discussed in Section 2.3. 

Below the Polya-Vinogradov range, we obtain: 
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Corollary 1.5. Let p be a prime number, let P{X) E F P [X] be a non-zero polynomial and 
let x : Fp —> C x be a multiplicative character. Assume that either x is non-trivial or that 
deg P ^ 3. Let 

V>{x) = x(x)e p (P(x)) (1.11) 


and 


<p(n) 


1 


^2 x(x)e p (P(x) +nx). 

x (modp) 


There exists 5 > 0, depending only on deg P such that for any interval I C R with |/| ^ p a 
we have 


E 

nG/ 


p[n) 


<C \I\ 


1-6 


-6 


where the implied constant depends only on deg(P). 


The basic input here is the work of Weyl, Burgess, Enflo, Heath-Brown, Chang and Heath- 
Brown-Pierce on short sums with summands of the type y(x)e(P(x)/p). 

Corollary 1.5 has partial consequences to the distribution properties of the cubic Birch 
sums in shorter intervals than is allowed in Corollary 1.3: 


Corollary 1.6. Let p be a prime number and let B 3 (n), the cubic Birch sum, be as in 
Corollary 1.3. There exists S > 0, such that for any interval I C R with |/| ^ pz~ 5 we have 


^B 3 (u)«|/| 1 - 5 (1.12) 

n£l 


and 

£|B 3 ( n )|’=|/|+0(|/r) (1.13) 

nG/ 

where the implied constants are absolute. Further, for such intervals I, and any 0 ^ t < - 
we have 

mm( E !- E 1 )^(y2^) + ° (1) ) |/| - 

nG/ nG/ v 

B3 [n)>t B3 (n)<—t 

and for any 0 ^ t < 1 we have 


E 

nG/ 

|B 3 (n)|>t 



Using delicate work of Bourgain and Garaev on Kloosterman fractions [4], [3], we obtain 
corresponding, but weaker, results for short sums of Kloosterman sums: 

Corollary 1.7. Let p be a prime number. For k ^ 1 an integer and ( a,p ) = 1 some 
parameter, let 

ip(x) = e p (ax~ k ) 

for ieF p x and 99 ( 0 ) = 0. Let 

e p {ax~ k + nx 

x (modp) 
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be its Fourier transform. 

For k = 1 and k = 2, there exists S > 0 such that for all x ^ y / p(logp) -5 we have 

| Y <C x(logx) -5 , 

l^n^x 

where the implied constant is absolute. 

All these applications of Theorem 1.2 are found in Section 3.2. 

Acknowledgments. We thank the referee for his or her careful reading of the text, and 
especially for pointing out a slip in the proof of Theorem 3.2 in the first version of this paper. 


2. Bridging the Polya-Vinogradov gap 

2.1. The basic inequality: Proof of Theorem 1.1. We may assume that m ^ 64 since 
otherwise the trivial bound cm is better than the claim. In that case we have |~i/mT| ^ m/4. 

For any r ^ 0, we denote by I r — r + / the interval / shifted by r, and by I r the Fourier 
transform of its characteristic function: 

Ir(t) - y ' 6m(h) 


X&I r 


By the discrete Plancherel formula, we have 


S(<p,Ir)= Y v( t ) I r(t)= Y F(t)I(t)e m (-rt) 

t^LlvriL tE'L/m'L 

for any r, where / = J 0 . Moreover, we have 

\S(p,I r ) - S(ip, I) | ^ 2 cr 

since \p(x)\ ^ c for all x. 

Let R = \^/m\. Since y/rri < |/|, we see that R is an integer with ^/m ^ R ^ |/|. Thus, 

|/| ^ R ^ \fm A m/\I\ ^ 1, m ^ 4i? 

(the last inequality because m ^ 64, as assumed at the beginning of the proof). Summing 
our identity for S(p, I r ) for 1 ^ r ^ R, we obtain 


RS(p,I)= Y + 

—m/2<t^m/2 


( 2 , 1 ) 


lsSr^R 


where A ^ 2cR 2 . 

Now, the Fourier transform / satisfies 

l^)K' 


i 


• /1 7-1 m \ 

mm [ 111 , —;—- 
m V 1 2|f|/ 


for —m/2 ^ t ^ m/2, and similarly, we have 

| Y 

1< r<R 


^ min ( R, 


m \ 

w 
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Using these bounds in (2.1), together with R ^ |/| and |<^(£)| ^ c, we get 


R\S( V ,I)\ic{ J2 R 


+ 


E 


m 1 / 2 

R — i . i—b 


E 


m 


3/2 


77^1/2 ^ 2 It I ^^ 4£^ 

|t|^m/(2|/|) m/(2|/|)<|t|^m/(2i?) m/(2R)<|iRm/2 


+ 2cir 


The hrst sum above is at most 


yfm V |/ 

Since m ^ 4i?, the third term is at most 
m 3/2 2 m 3//2 

E 


+ i) 


2 ^ £ 2 
t>m/(2R) 

We claim that the middle term is 


m 3 / 2 R 


2 m/(2R) — 1 m-2R 


U 2 Ry/m. 


Ry/m Y] 


m / (2\I\)<t^m / (2R) 


from which it follows that 


U Ry/m log 


ISYyj, J)| < Acy/rn + 2cR + c^/mlog f£ cy/mlog (-^=^) + 8 cyfm, 

\JmJ \JmJ 


4|/| 


-4|/| 


as desired. 

To verify the claim (2.2), note that if |/| < m/4 then the quantity in question is 


U Ry/m log 


m /(2 R) 


U Ry/m log 


m/ (2 R) 


,m/(2|/|) - 1, 

which verifies (2.2) in this range. If ?n/4 ^ |/| < m/2, then we may use the bound 

Ry/m ~ U Ry/m — ' ^ D / ™ 1 — 1 

l<t^m/(2R) 

which again verifies (2.2). Finally, if |/| ^ m/2, then 

1 

1 


— E R\/m 
2 R 


Ry/m Y f < Ry/m[l + log( — ) ) ^ 

t^m /(2 R) 

which completes our verification of (2.2). 


4|/| 


( 2 , 2 ) 


2.2. Applications to trace functions, I: the additive case. We now recall the definition 
and give some basic examples of trace functions before proving Corollary 1.3. As is usual, we 
will restrict our attention to prime moduli; the extension of the results to squarefree moduli 
at least is a matter of applying the Chinese Remainder Theorem. 

Thus let p be a prime number. Given a prime i / p, we fix an isomorphism l : ~ C, 

and we use it implicitly to identity any Uadic number with a complex number. A Fourier 
sheaf modulo p is defined to be a middle-extension Q^-adic sheaf T on Ap p , that is pointwise 
pure of weight 0 and of Fourier type, i.e., none of its geometric Jordan-Holder components 
is isomorphic to an Artin-Schreier sheaf for some additive character if. 

Remark 2.1. Note that in contrast with the definition of Katz [18], we impose the weight 
0 condition instead of stating it separately. 
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The (Frobenius) trace function of T is the function Z/pZ —> C dehned by 

ip = t(tr(Fr XjFp |J-)) (2.3) 

for any x G F p . It is a deep property, due to Deligne, that the Fourier transform of the 
trace function of T is also a trace function, namely that of the sheaf-theoretic (normalized) 
Fourier transform of T. 

The complexity of the trace function is controlled by the (analytic) conductor of the sheaf 
IF, which is dehned as 

c(J r ) = rank(J r ) + n(IF) + Swan x (J r ), 

xeS(T) 

where rank(J r ) is the rank of IF, S{T ) C P 1 (F p ) is the set of singularities of IF, the integer 
n(J-) ^ 0 is the cardinality of S(IF) and Swan x denotes the Swan conductor at such a 
singularity. The conductor is a non-negative integer, and from its properties (see [8, Prop. 
8.2] for (2.5)) we have the following inequalities 



IMloo ^ rank(J 7 ) ^ c(J') 

(2.4) 

and 


c(^) < 10 c(T) 2 . 

(2.5) 

Thus we obtain 


maxdl^Hoo, Halloo) ^ lOc(J') 2 . 

(2.6) 


This means that, for instance, Theorem 1.1 or Theorem 1.2 can be applied efficiently 
to a sequence of trace functions modulo primes p —* +oo, provided the conductor of the 
underlying sheaves is bounded independently of p. 


Example 2.2. Let f\, f 2 ^ 0, g\ , g 2 ^ 0 be rnonic polynomials with integer coefficients 
such that /i is coprime to / 2 and g\ is coprime to g 2 . Let p be a prime number and y a 
multiplicative character modulo p. If x is trivial, we adopt the convention that f\ — f 2 — L 
If x is non-trivial, we assume that none of the zeros or poles of /i// 2 has order divisible by 
the order of y. Define the rational functions / = /i// 2 and g = gi/g 2 , and let 

¥>(n) = x(/H)e p (g(n)) 

for n G Z such that 

fi(n)f 2 {n)g 2 {n) ± 0 (rnodp), 

where f{n) = /i(n)/ 2 (n) and g{n) = gi(n)g 2 (n) are computed in F p , and let 

ip(n) = 0 

if fi(n)f 2 (n)g 2 (n) = 0 (rnodp). 

For all primes p large enough, the poles of g are of order < p. For any such prime, the 
function p is the trace function of a middle-extension sheaf IF with 

c(T) < deg(/i) + deg(/ 2 ) + degfoi) + deg(^ 2 ), 

where the implied constant is absolute. If / is not constant modulo p or if g is not a 
polynomial of degree at most 1, this sheaf is a Fourier sheaf. 




Proof of Corollary 1.3. (1) We can certainly assume that /3(p) < p 1//2 for all p. By the Weyl 
criterion, we must show that, for any fixed integer h ^ 0, and for the interval I p , the sums 

7T-\J2 e p( h f( n ^ 

]lp ' nel p 

tend to 0 as p —> +oo. For a given p, and a suitable f-adic non-trivial additive character if 
of F p , there exists a rank 1 sheaf T = £^hf(x)) with trace function given by 

p(x) = e p (hf(x)) 

for all x G F p . This is a middle-extension sheaf modulo p, which is pointwise pure of weight 
0. For p large enough so that hf(X) is not a polynomial of degree ^ 1, this sheaf is a Fourier 
sheaf. Its conductor satisfies 

c(JF) 1 T (1 T deg(/ 2 )) + ^ ord x {f 2 ) + deg(f 1 )<^l 

x pole of /2 

for all p large enough (the first 1 is the rank, the singularities are at most at poles of f 2 and 
at oo, the Swan conductor at a pole of f 2 is at most the order of the pole, and at infinity it 
is at most the order of the pole of / at infinity, which is at most the degree of ff). Hence, 
there exists c ^ 1 such that the trace function ip satisfies 

max(H^Hoo, ||£||oo) < c 


for all large p. By Theorem 1.1, we get 


T7T e p( h f ( U )) 

1 pl n&I p 


TT-T S(p,I p ) < 

\ 1 p\ 



•c 


log Pip) 
Pip) 


by assumption. 

(2) Let 9 p = 6 _i p or depending on whether one considers Kloosterman sums or Birch 

sums. Using the Weyl criterion, and keeping some notation from (1), it is enough to show 
that for any fixed d ^ 1, we have 

lim —V Ud (2 cos 9 p (n)) = 0 

p-H-oo / ^' 


where Ud G Z[X] is the Chebyshev polynomial dehned by 

Ud(2 cos 9) = sin((d + 1)9)/ sind. 

By the theory of the Fourier transform of sheaves (see the exposition in [18, Ch. 8] and 
the survey in [8, §10.3]), the function 

p(x) = U d (2cos9 p (x)) 

is the trace function of an Uadic irreducible middle-extension Fourier sheaf (the symmetric 
d-th power of the rank 2 Kloosterman sheaf or of the Fourier transform of the sheaf C^( x 3), 
which is also of rank 2, both of which are irreducible); this sheaf has rank d + 1 ^ 2 on the 
affine line over F p , and its conductor is bounded by a constant depending only on d, and not 
on p. It is therefore a Fourier sheaf with trace function satisfying 


max (IMIoo, Halloo) c 
9 




for some c depending only on d, and hence the desired limit holds again by a direct application 
of Theorem 1.1. □ 

Another interesting and somewhat similar application is the following: 

Proposition 2.3 (Polynomial residues). Let (3 be a function defined on integers such that 
1 ^ (3{m) — y Too as m — > Too. Let f E Z[X] be a non-constant nnonic polynomial. For 
all primes p large enough, depending on f and fi, and for any interval I p modulo p of size 
\Ip\ ^ there exists x E I p such that x = f(y) for some y E F p . In fact, denoting 

by P the set /(F p ) of values of f, the number of such x is ~ 5f\I p \ as p —> Too, where 

5 f = \P\/p. 

Here again, the interest of the result is when fi(m) is smaller than logm. However, it 
seems likely that this distribution property should also be true for much shorter intervals (as 
in the well-known conjecture for quadratic (non)-residues). 

Proof. Let p be the characteristic function of the set P of values f(y) for y E F p . We must 
show that, for p large enough, we have 

Y <p(z) ~ Sf\I p \ 

xG I p 

(which in particular implies that the left-hand side is > 0 for p large enough.) 

By [9, Prop. 6.7], if p is larger than deg(/), there exists a decomposition 

<p( x ) = y ^Cjipfix) 

i 

where the number of terms in the sum and the c t are bounded in terms of deg(/) only, and 
where <pi is the trace function of a tame Tadic middle-extension sheaf Jy with conductor 
bounded in terms of deg(/) only. Moreover, Pi is the trivial sheaf with trace function equal 
to 1, all others are geometrically non-trivial and geometrically isotypic, and 

ci = 8 f TO(jT 1/2 ), 

where 5f = \P\/p and the implied constant depends only on deg(/). In particular, Jy, being 
tame and geometrically isotypic and non-trivial, is a Fourier sheaf for i 1. We also note 
that 8f 1 for primes p > deg(/). 

This decomposition implies 

Y. f( x ) = Cl I ip I T = 8f\Ip\ T + °(p _1/2 I4I)- 

x£lp 

Since the Pi, for i 1, are Fourier sheaves, we get by Theorem 1.1 

S(p,I P ) « v^log(i^) < |/ F | y , 

for each i 1, where the implied constant depends only on deg(/). Hence we obtain 

Yf( x ) ~ 8f\I P \ 

X^Ip 

uniformly for p > deg(/), since j3(p) —$■ Too, which gives the result. □ 
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2.3. Applications to trace functions, II: the multiplicative case. We consider now 
a different application of the basic inequality: for a prime p, we look at the values of trace 
functions modulo p on the multiplicative group A = F* ~ Z/(p — 1)Z. Fixing a generator g 
of A, this means that we are now looking at sums over geometric progressions xg n for n in 
some interval / in Z/mZ = Z /(p — 1)Z. Such sums have also been considered by Korobov, 
for instance (see, e.g., [20, Ch. 1, §7]). 

We will use the notation and terminology of the previous section, but to avoid confusion 
we write tt for the restriction of the trace function of a sheaf T to F* . The discrete Fourier 
transform becomes the discrete Mellin transform 

Tr(x) = y==j Y r A x )x( x ) 

z€Fp 

defined for x m the group of multiplicative characters of F*. (More precisely, this Mellin 
transformed can be identified with the discrete Fourier transform on F* ~ Z /{p — 1)Z; as 
we are interested in bounds for the maximum of the Fourier transform, we may as well use 
the multiplicative characters as arguments). 

The analogue of Fourier sheaves in this case are the sheaves with “property V ” of Katz’s 
work on the discrete Mellin transform [19, Chapter 1], 


Proposition 2.4. Let p be a prime number, and let IF be an t-adic middle extension sheaf 
modulo p with conductor c, pointwise pure of weight 0. If no geometric Jordan-Holder com¬ 
ponent of T is isomorphic to a Kummer sheaf C x associated to a multiplicative character 
X, then the Mellin transform of the trace function Tjr is bounded by 2a/2 c 2 , i.e., for any 
character x °f F*, we have 


1 

VP~ 1 


Y Tjr(x)x(x) 

xeFp 


< 2a/2c 2 . 


Proof. This is again a form of the Riemann Hypothesis of Dcligne. By the Grothendieck- 
Lefschetz trace formula, and the assumption on IF which ensures the vanishing of Hf, we 
have 

Y r A x )x( x ) = —Tr(Frob Fp | H]{ G m x F p , J 7 ® C x )). 

xGFp 

Then, since the sheaf involved is pointwise mixed of weight 0 on G m , Dcligne’s Theorem 
implies that each eigenvalue of the Frobenius acting on H\ (G m x F p , T ® C x ) has modulus 
at most y/p. Thus 


Y t A x )x(x) 

xgFp 


< (dim H l c (G m xF p ,J® £ x ))) v / p. 


Let U C G rn be the maximal dense open subset where IF is lisse. By the Euler-Poincare 
characteristic formula, we have 


dim H'(G rn xF p ,J® C x ) = -y c ( G m x F P ,IF® C x ) 

= Swan 0 (J r <8) C x ) + Swan 00 (J r ® C x ) + ^ (drop a ,(J : ' (g) C x ) 

X£{G m -U) 


li 


+ Swan a ,(J r ® C x )) 









(see, e.g., [19, p. 67]). Since C x is tame of rank 1 , we have Swan 2 .(J r 0 C x ) = Swan^J 7 ) for 
all x, and therefore 

Swan 0 (J r 0 C x ) + Swan 00 (J r 0 C x ) + Swan a ,(J r 0 C x ) ^ c(J r ) = c. 

X&{Gm~U) 

Furthermore, we have drop x (.F 0 C x ) ^ rank(J r ) ^ c for all x, and at most c points occur 
where it is non-zero. Thus we derive 


dim H l r (G m x F p , J 7 0 £ x ) ^ c + c 2 ^ 2c 2 . 
Finally, since yfp ^ a /2y/p — 1 for all primes p, we deduce the result. 


□ 


Corollary 2.5. Let c ^ 1, /etp 6e a prime number, and let T be an t-adic middle extension 
sheaf modulo p, pointwise pure of weight 0, with no Kummer sheaf as a geometric Jordan- 
Holder component. Assume that the conductor of T is ^ c. 

Let g G F^ be a generator ofFf and let x G F* be given. For any interval I in Z/(p—1)Z, 
we have 


^Tj:(xg n ) ^ 2 v / 2 c 2 v / p- llog 


nel 


4e 8 


Vp- 1 


This follows immediately by combining Proposition 2.4 and Theorem 1.1 with m = p — 1. 
From this result, we can deduce equidistribution statements exactly similar to Corol¬ 
lary 1.3, with geometric progressions replacing intervals, since the sheaves used in the proof 
are in fact geometrically irreducible and are not Kummer sheaves. 

Proposition 2.3 also extends with some restriction on the polynomial: 


Proposition 2.6. Let (3 be a function defined on integers such that 1 ^ f3{n) — > +oo as 
n — y Too . Let f G Z[X] be a non-constant squarefree monic polynomial. For all primes p 
large enough, depending on f and /3, for any primitive root g modulo p, and for any interval 
I p in Z /{p — 1)Z of size |/ p | ^ (p — l) 1//2 /3(p), there exists n G I p such that g n = f(y) for 
some y G F*. 

The only change in the proof of the previous case is that we must ensure that the sheaves 
Ti for i 7 ^ 1 appearing there have no Kummer sheaf as Jordan-Holder component, for p large 
enough. This is indeed the case because the assumption that / is squarefree implies first 
that / is squarefree modulo p for p large enough, and from this follows for each such prime 
that each J 7 is lisse at 0 (see [9, Prop. 6.7]), which is not the case of C x . 


3. The Fourier transfer principle 

3.1. The basic principle. In this section we provide a quantitative version of the transfer 
principle discussed in Section 1.2. The idea is to estimate a sum 

tel 

by beginning with an application of the completion method in a smooth form, followed 
however by a summation by parts that allows us to exploit bounds for short sums of the 
original function <p. 
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Proposition 3.1. Let m ^ 2 be an integer and let 

ip : Z/mZ —» C 

6e an arbitrary function. Let c ^ 1 be such that 

maxGMIoo, Halloo) ^ c. 

For any N with |iV| ^ m/2 we have 

TT N cN N f m/2 a, lo/ \ . fN 1 1 \ 

Ar) 1 « cm[/++ ^ + 7^ l O ' s{<p ' t] 1 +_t) 0 111111 w ? ¥ip) dt ’ 

where the implied constant is absolute, and for any U G [1/m, N/ {2m)}. 


Proof. We will view ip as a function on Z which is periodic modulo m. We consider the case 
of positive N with 1 ^ N ^ m/2, the negative case being entirely similar. 

Let U G [1/m, N/(2m)\ C (0,1/4] be some parameter. We fix a smooth function 4' : 
[0,1] —> [0,1] with compact support contained in [U, N/m + U\ C [0,1], such that 

• For x G [2 U,N/m] we have 4/(re) = 1. 

• The function 4/ is increasing on the interval [U, 2 U] and decreasing on the interval 
[N/m, N/m + U], 

• For any integer l ^ 0, we have 

T w (a) U~ l . 


We extend 4/ to a 1-periodic function on R and consider its Fourier expansion 


4/(a) = 4 '/(h)e(-ha), 

he z 


(3.1) 


with 


T(h) 



4 f (a)e(/ia)da. 


(3.2) 


The properties of the derivatives of 4/ immediately imply the following bounds for its 
Fourier coefficients 

N - 1 /N\ 

T(0) =-+ 010 = 0-1 (3.3) 

m Vm/ 

and 

T(h) <,4 m m(^, J^( u \ h \)~ A ) ( 3 - 4 ) 

for any /i / 0 and any A ^ 0. Indeed, this follows from the definition of the Fourier 
coefficients using repeated integrations by parts and the fact that 4/ is supported in an 
interval of length <C N/m, whereas the derivatives of 4/ are supported in the union of two 
intervals of length U. 

Now, we observe furthermore that the expression (3.2) defines a smooth function of h on 
the whole real line (namely, the Fourier transform of 4/ seen as a function on R). We have 
then 

4 t'(h) = 2ir i 


/ N\ 2 

a^{a)e(ha) da -C —) , 
Vm/ 


'[ 0 , 1 ] 
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(3.5) 







for all h and moreover, for h f 0, we have (after integrating by parts A ^ 1 times as in 
(3.4)) 


&(h) 


N 

m\h\ 


(U\h\) 


-A +1 


(3.6) 


We now begin the estimation of the partial sums of (p. We have 

<p(n)= (ptn)*Q+0(\\(pUmU) 

0^n<m. 

= m 1 / 2 E ^f(h)tp(h) + 0(cmh), 


l<n^N 


(3.7) 


he z 


where the implied constant is absolute, and where the second step (a version of the Plancherel 
formula) follows upon using the Fourier expansion (3.1). 

By (3.3), the term h — 0 in (3.7) equals 


y/mip( 0) 


N — 1 


+ 


m 


•C 


N cN 
•C 


m 


m 


Next consider the contribution of the positive values of h. By partial summation, these terms 
contribute 


—yfm J S(<p, t)^\t)dt 

N 2 rm/N 

/ 

\m z /1 


\S(ip,t)\dt + 


" f' /u \S(<P,t )\, N 


dt 


m 


'm/N 


mU 3 


h/u 


|£(y^)| 

t 4 


dt ), 


upon using the estimate (3.5) in the range t ^ m/N , and the estimate (3.6) with A = 1 in 
the range m/N ^ t ^ 1/U and with A = 4 when t > 1/U. An analogous estimate holds for 
the contribution of negative h to (3.7), and gathering these estimates together we obtain 

|S(^. N)\ « cmU + c//= + N/ (|S(P, 01 + |S(7>, ~‘)l) min (A / fjif. 

To complete the proof the proposition, it remains to bound the portion of the integral with 
t > m/2. Write t > m/2 as t = u + km where k ^ 1 is an integer, and \u\ ^ m/2. Then by 
dividing the intervals [1,7] and [— t , —1] into complete intervals of length m with intervals of 
length \u\ left over we see that 

|S((p,f)| + |S'(^,-t)| <C |S'(^,m)| + |S'(^,-u)| + 


since the sum of (p over a complete interval is ^ Cyfm in size, ft follows that the terms 
t > m/2 in the integral contribute 

n ( r°° ct dt r /2 nci , m , 0 / 1 , ^ 


•C 


•C 


cN 


N 


m 3 U 3 yfm 

The proposition follows. 


rm/2 J 

(|%?,u)| + |%>,-u)|) au 


m 4 f/ 3 ' 


□ 


We are now ready for the proof of Theorem 1.2. 

14 



















Proof of Theorem 1.2. We first demonstrate (1.9), by an application of Proposition 3.1. To 
estimate the integral in Proposition 3.1, we bound \S((j), t)\ + \S((p, — f)| for 1 ^ t ^ m/2 by 


max 

t^m/N 


and 


max 


j(l%h*)l + l%>,~*)l) ^ cA(</?, . 


Thus, it follows that 


N cN 

\S{(p,N)\ + \S{$, -N)\^cmU + c— + 


m m 3 U 3 


+ c 


A 1 / m\ 

~^c7 A w’a/' 


Now choose A = Asm -4 A(yj, m/N) 2 ; since A(</?, m/A) ^ 1/y/rri, it follows that U ^ 
Nzm~ l ^ 1/m, and we may also assume that £/ ^ A/(2m) else the estimate (1.9) holds 
trivially. Thus our bound above applies, and it gives (noting that A (ip,m/N)/(-^mU) ^ 
l/(mU) ^ l/(mf/) 3 ) 


l‘S'(£,A)| + |S(£,-A)| «c^VmiA^, — j 2 +c 


A 


•C c\/lVm 4 A^, — 


Thus we have established (1.9), and with it in hand, it is a simple matter to verify the 
second assertion of the theorem. If t ^ A, then by (1.9) we obtain 


— ( S((p,t) + |s(£,-t)|) <C 


y/t'. 


m 4 


A 




m\ 2 


•C 


m 4 


Vn 


m\ 2 




since A (ip,m/t) is a non-decreasing function of t. Finally if A ^ t ^ m/2 then 


ct 


1 

m 4 


— (|5'(^,t)| + |S'(^, -t)|) < - 77 A(¥>, — =m 4 — A l(p, — 


Vt 


m\ 2 


_i fm 


t J 


V t 


m 


_i fm.f m 
< m 4 —A 02 , — 
VA A 


since ( m/t)A((p,m/t ) is a non-increasing function of f. Combining these estimates, and 
noting that 1 / y/rn is smaller than (m 4 /\/A)A((^, m/A)a, we obtain the theorem. □ 


3.2. Applications. We will prove quantitative versions of Corollary 1.5, in the sense of 
specifying the value of the quantity <5 > 0 that appears there. The argument splits naturally 
in two cases, depending on whether the character y in (1.11) is trivial or not. We begin with 
the former case, where we can in fact work with an arbitrary squarefree modulus m ^ 1 . 

Theorem 3.2. Let m ^ 1 be a squarefree integer. Let P € (Z/mZ)[A] be a polynomial of 
degree d ^ 3 with invertible leading coefficient. Let p : Z/mZ — > C be defined by 

<p(n) = e m (P(n )) 

and let 

T( n ) = ffpfl e m (P(h) + nh) 

be its Fourier transform. 

For any rj < 1 /(2 rf — 2) there exist S > 0 depending only on g such that if A ^ m 1 / 2- ^, we 
have 

Y. 

l^n^N 
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where the implied constant depends only on p and d. 


Proof. We begin by noting that a combination of the Weil bound for exponential sums with 
additive characters and of the Chinese Remainder Theorem shows that 

\(p(t)\ < (d- 1 ) W M <m £ 

for any £ > 0 and any t G Z/mZ, where the implied constant depends only on £ and d (we 
use here the fact that P is of degree d modulo any prime divisor of m, since we assume that 
the leading coefficient is invertible modulo m). In particular, we get 

c = maxdl^Hoo, Halloo) < m £ . (3.8) 

Let k = l/2 d ~ 1 . The key ingredient is the Weyl bound 


E ^(P(h)) 


<C H 


l+£ 


1 m \ K 

77 + W) 


(3,9) 


valid for an arbitrary £ > 0 and 1 ^ H ^ m, with an implied constant that depends only 
on d and £ (see [17, Lemma 20.3] or [23, Lemma 2.4], and for recent bounds that are much 
stronger for large d, see [24, Theorem 1.5]). This implies that for 1 ^ t ^ m/2, we have 

l‘%>,±*)l <min(jf|,|f| 1+£ (^ + ^) ) 

for any e > 0, where the implied constant depends on d and £ only. By (1.6), this leads to 


A (*>,#)« 


m 

H- 

m c 


1 m K 
H K H dK 


for any £ > 0 , where the implied constant depends on d and £ only. 
Appealing to (1.9) from Theorem 1.2, we conclude that 


. , / AT K AT dn \ 

V' <p(n) < cN 1/2 + c 1/2 N^m J+e ( - 1 - mr) 

' \m K m (<*-!)«/ 


n^.N 


which, with a small calculation using (3.8), yields the theorem. □ 

When the character y in (1.11) is non-trivial, we will need to assume that the modulus is 
prime. 

Theorem 3.3. Let p be a prime number. Let P € F P [X] be a polynomial of degree d ^ 0, let 
X he a non-trivial multiplicative character modulo p and let m G F p . Define Lp : F p —> C 
by 

<p{n) = xi n + m)e p (P(ri)) 

and let 


<p(n) = ~J/2 + m ) e r( P ( /l ) + nh "> 


p 


l^h^p 


be its Fourier transform. 

For any g < 8 ^ 2 ^ d+1 ) there exists S > 0 such that for all N with N ^ we have 

E £(")« n 1 -*, 

l^n^N 

where the implied constant depends only on 77 and d. 
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Proof. The method is similar to the previous case. However, we now use instead of the Weyl 
bound a recent result of Heath-Brown and Pierce, namely 


^2 x(n + m)ep(P(n)) 

l^nsJiV 


•C d, r (logp ) 2 min ( p 1/2 ,p~^^~N 


T 1 - 


where D = d(d + l)/2 (see [16, Th. 1.2], noting that the bound is stated there only for 
N ^ g 1 / 2+1 / 4r ; but that it becomes weaker than the Polya-Vinogradov bound when N ^ 

p l/2+l/4r)_ Thus 

A (<p,N) <C d, r (log p) 2 p~bb~ N~r. 

Applying (1.9) of Theorem 1.2, we obtain (for N > p^) 


n^.N 


(p(n) 


<C p ~ 4 




b ]\fh~ 


(log p). 


Choosing r = 2(D + 1), the theorem follows. 


□ 


Remark 3.4. The bound of Heath-Brown and Pierce is the latest of a series of works by 
Enflo, Heath-Brown and Chang, see [6,7,15], any one of which would lead the qualitative 
form in Corollary 1.5. 


We now consider the special case of the cubic Birch sums to prove Corollary 1.6. 

Proof of Corollary 1.6. Let p be prime and let ip be defined on F p by 

<p(h) = e p (h 3 ). 

As before, we denote by 

B 3 (ra) = ~T/2 S e P ^ 3 + nh ^> 

P l^/isgp 

its Fourier transform. Note that for any fixed m 6 Z, the shifted Birch sums n t—>■ B 3 (n + m) 
is also a Fourier transform of a polynomial, namely of 

h 1 —y e p {h 3 + hm ). 

Thus the estimate (1.12) for the first moment is a special case of Theorem 3.2: for any 
77 < 1 / 6 , there exists 6 > 0 , depending only on 77 , such that 

^ B 3 (n) « N 1 - 5 

m^n^m+AT 

for all m G Z, provided N > p 1 / 2 - 71 . 

We now consider the second moment. We assume p ^ 5 and we define 

fj{n) = |B 3 (n )| 2 - 1, 

and compute its Fourier transform. For any h G F* , we have 
${h) = d B 3 ( n )l' - 1 )ep{nh) 

P nGFp 

= i £ e p (u 3 -u 3 + n(u-u + h)) =-^ e p (u 3 -(u + h) 3 ). 

“ u,v,neF p " ug F p 
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This is a quadratic complete sum, hence it can be evaluated exactly. Precisely, we obtain by 
completing the square the formula 

$(h) = £ p (^je p (—h 3 / 4 ) 

for h e Fp, where e v is a complex number of modulus one independent of h. In fact, the 
same formula holds for h = 0 , as one checks immediately by a similar computation. 

By the discrete Fourier inversion formula, we deduce that ^ is the Fourier transform of 
the function n H* £ p (^)e p (n 3 /4). Hence Theorem 3.3 implies (after using an additive shift 
by m as above to get the estimate for any interval) 

£ (|B 3 (n )| 2 — 1) « N~ s 

771^71^771+N 

for any 77 < 1/104 and some <5 > 0 depending only on 77 , which is more precise than (1.13). 

We now prove the final part of Corollary 1.6 concerning the distribution of values of Birch 
sums. Let 77 < 1/104 be fixed and let / be an interval in Z with |/| ^ p 1 / 2 ' 77 . From our work 
above, we know that for some S > 0 (depending only on rf) we have 

(l + 0(p- i ))|/| = ^|B 3 (, l )| 2 <i 2 ^ 1 + 4 £ 1 , 

nE/ nE/ nE/ 

B 3 (n)e[0,t] B 3 (n)^[0,t] 

where we have used the Weil bound |B 3 (n)| ^ 2. With a little rearranging, this yields 

£ 1 «(jV7i + °( 1 ))l / l- (3.10) 

nE/ 

B 3 (n)e[0,7] 

On the other hand, since 63 ( 77 .) = o(|/|) we have 


£ B 3 (n ) 2 ^ 2 ]T |B 3 (ra)| — 2 B 3 (n)+o(|/|) < 21 ]T 1+4 £ l+o(|/|), 


nS/ 

B 3 (n)<0 


n£l 

B 3 (n)<0 


n£/ 

B 3 (n)^0 


n£l n£l 

B 3 (n)G[0,t] B 3 (n)G(t,2] 


and therefore 

(l + 0 (l))|/| = £|B 3 (n)|-^ '^2 1 + 4 ^2 l + 2t ^2 1 + 4 X/ 1 + °(I / D- 


nE/ 


nE/ nE/ 

B 3 (n)E[ 0 ,t] B 3 (n)E(t, 2 ] 


nE/ nE/ 

B 3 (n)E[ 0 ,t] B 3 (n)E(£, 2 ] 


Combining this with (3.10), we deduce that (for t < 1/2) 


E 1 

ng7 

B 3 (n)>i 


>-li¬ 


st 

2 -t 


+ o(l) )\I\ = 


1 - 2 1 
4(2 - 1) 


+ °(1)) /I) 


as desired. The bound on the number of 63 ( 77 ) < —t is obtained similarly. The last bound 
on the frequency of n with 163 ( 77 ) | ^ t follows upon noting that 


/ 


~^|B 3 (n)| 2 $f 2 E 1 + 4 E 1 

nE/ nE/ nE/ 

|B 3 (n)|<t |B 3 (n)|>t 


t 2 \I\ + (4 — t 2 ) ^2 !• 

nel 

|B 3 (n)|>i 


□ 


Finally, we prove Corollary 1.7. 
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Proof of Corollary 1.7. Let p be prime and a € F*. Let k = 1 or k = 2, and define 

(p(h) = e p (ah ~ k ) 

for he Fp and 99 ( 0 ) = 0. Then 

<p(n) = —L Y e p (ah~ k + nh ). 


P 




In [3,4], Bourgain and Garaev have obtained non-trivial estimates for sums of ip. Precisely, 
they proved that there exist absolute constants <5 > 0 and 0 < 77 < 1 such that 


max 

ciE F n 


\-S 


(3.11) 


IX Y e p (ah fc ) <C H(\ogp)~ 

^ l^h^H 

provided H ^ exp((logp) r? ). Thus, if H ^ exp((logp) ?? ), then A (<p,H) <C (logp) -5 . 
Applying (1.9) of Theorem 1.2, we obtain for p? (logp )~2 $7 N ^ p 2 (logp) 2 , 

S(p,N) -C VNp* (logp ) _ 2 N(logp)-*. 

Since this bound holds also in the range N ^ p^(logp ) 2 by Polya-Vinogradov, our proof is 
complete. □ 
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